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OM protein - nucleic search, using f rame_plus_jp2n model 



Run on: 



April 4, 2003, 16:35:42 



Search time 1467 Seconds 

(without alignments) 

2142.538 Million cell updates/sec 



Title : 

Perfect score : 
Sequence : 



US-09-541-462B-2 
616 

1 MAAAMDVDTPSGTNSGAGKK 



KTRQVCPLDNREWEFQKYGH 108 



Scoring table : 



BLOSUM62 



Xgapop 10.0 
Ygapop 10.0 
Fgapop 6 . 0 
Delop 6 . 0 



Xgapext 
Ygapext 
Fgapext 
Delext 



0,5 
0.5 
7.0 
7 . 0 



Searched : 



2054640 seqs, 14551402878 residues 



Total number of hits satisfying chosen parameters: 



4109280 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% - 
Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-M0DEL=frame+_p2n. model -DEV=xlp 

Q=/cgn2_l/USPTO_spool/US09541462/runat_31032003_090927_7154/app_query.fasta_l 
3 

-DB=GenEmbl -QFMT=f astap -SUFFIX=std . rge -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 - TRANS =human4 0 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR__MIN=0 -ALIGN=15 - MODE = LOCAL 
-OUTFMT=pto -N0RM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 
-USER^US 0954146 2_@CGN_1_1_3 7 4 5_@runa t_3 1032003_09092 7_7 154 -NCPU= 6 - 1 CPU= 3 
-NO_XLPXY -NO_MMAP - LARGE QUERY -NEG_SCORES=0 -WAIT -LONGLOG -DEV_TIMEOUT=12 0 
-WARN_TIMEOUT=30 - THREADS =1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 -FGAPEXT- 7 
-YGAPOP=10 -YGAPEXT=0.5 -DELOP=6 -DELEXT=7 

Database : GenEmbl : * 



1: 

2 : 

3 : 

4 : 

5 : 
6: 
7: 
8 : 
9: 
10 



gb_ba : * 
gb_htg : * 
gb_in : * 
gb_om : * 
gb__ov : * 
gb_ pat : * 
gb_ph : * 
gb__pl : * 
gb_pr : * 
gb_ro : * 



I 



11 


gj^ s t S : * 


12 


gb_sy : * 


13 


gb un : * 


14 


gb vi : * 


15 




16 


em fun : * 


17 


em hum : * 


18 


em in : * 


19 


em mu : * 


20 


em om : * 


21 


em or : * 


22 


em ov : * 


23 


C 1 1 1 \J d L. • 


24 


em ph ; * 


25 


pi ; * 


26 




27 


Will O o - 


28 


em un : * 


29 


em vi : * 


30 


em htg hum : * 


31 


em htg inv : * 


32 


em htg other : * 


33 


em htg mus t * 


34 


em htg_pln:* 


35 


em_htg_rod : * 


36 


em_htg_mam : * 


37 


em_htg_vrt : * 


38 


em__sy : * 


39 


em_htgo_hum : * 


40 


em_htgo_mus : * 


41 


em_htgo_other : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



NO. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


616 


100 


0 


327 


9 


AF142059 


AF142 059 Homo sapi 


2 


616 


100 


0 


504 


10 


AF140599 


AF140599 Mus muscu 


3 


616 


100 


0 


508 


9 


AF140598 


AF14 0598 Homo sapi 


4 


616 


100 


0 


544 


9 


BC001466 


BC0 014 66 Homo sapi 


5 


616 


100 


0 


554 


9 


BC017370 


BC017370 Homo sapi 


6 


616 


100 


0 


1616 


10 


BC027396 


BC027396 Mus muscu 


7 


604 


98 


1 


1933 


5 


AY027936 


AY027936 Salmo sal 


8 


584 


94 


8 


497 


9 


HUMYQ60A05 


AF085906 Homo sapi 


9 


573 


93 


0 


5347 


6 


AX281690 


AX281690 Sequence 


10 


554 


89 


9 


3484 


9 


HSTEST 


X73608 H. sapiens m 


11 


545 


88 


5 


306 


9 


AY099360 


AY0 993 60 Homo sapi 


12 


535 


86 


9 


1181 


3 


AY119265 


AY119265 Drosophil 


13 


515.5 


83 


7 


635 


8 


AY086913 


AY086913 Arabidops 


14 


515.5 


83 


7 


666 


8 


AY072430 


AY072430 Arabidops 


15 


514 .5 


83 


5 


357 


8 


AY052401 


AY0524 01 Arabidops 


16 


514 .5 


83 


5 


544 


8 


AY114719 


AY114 719 Arabidops 





17 


501.5 


8 1 


> 4 


32 0 8 


5 


AX21 7267 


AX212267 Sequence 




18 


4 92 . 5 


8 0 


. 0 


361 


3 


AH0772 8 7 


ABO 7 72 8 7 Caenorhab 




19 


469 


7 6 


_ 1 


324 


3 


AF179228 


AF179228 Schizosac 




2 0 


4 64 . 5 


75 


t 4 


4 663 0 


3 




AL031581 Drosophil 




2 1 


464 . 5 


75 


m 4 


102227 


2 


AC020129 


AC02 012 9 Drosophil 




22 


464.5 


75 


t 4 


161668 


3 


AC104602 


AC104602 Drosophil 




23 


464 . 5 


75 


_ 4 


299970 


3 


AE003418 


AE003418 Drosophil 




24 


433 


70 


. 3 


433 


11 


G27926 


G2 7 926 human STS S 




25 


428 


69 


. 5 


40197 


3 


CBRG33P2 1 


AC084553 Caenorhab 




26 


425.5 


69 


_ 1 


171237 


2 


ACT 23 "*4^ 


AC123343 Ratt-nc nn 




27 


411.5 


66 


. 8 


403 87 


g 


SPAC2 3H4 


Z9897 7 S . pombe chr 




28 


410 . 5 


66 


. 6 


104787 


9 


AC 109638 


AOIOQfi^R Rnmn qani 
±. \j ^ o sj nvJuiVw/ ocipx 


c 


29 


402 . 5 


65 


, 3 


3 9874 


3 


CEZK287 


Z70757 Caenorhahdi 




30 


400.5 


65 


. 0 


553 


3 


AY07 0810 


AY070810 D-m^nnhil 




31 


400.5 


65 


. 0 


750 


3 


AF2 182 90 


j£ X O ^ J? U 1^ X UDU^JUl X 


c 


32 


400.5 


65 


. 0 


3185 


3 


AY061 3 02 

-T"i X V O X ^ V £i 






33 


400.5 


65 . 


. 0 


95294 


2 


AC019742 


AC01 974? Dro«;nnhi 1 




34 


400.5 


65 . 


. 0 


167977 


3 


AC010 010 


AL. \J -L \J \J J- \J i^LUk>UpjlJ.± 




35 


400.5 


65 . 


. 0 


310120 


3 


AE003468 


navy JIOO UlUiSUpHll 




36 


385 


62 . 


. 5 


12 55 


3 


OUlUUlJ'lllyf 


Ai / *± O / O O • LCLCVlbla 




3 7 


3 85 


62 


5 


12 805 


3 


OA V 


Ajjtgj O . LcrcVlSla 


c 


38 


378 


61 . 


( 4 


1643 99 


3 


PFMATj3P6 

fr IT 1'lrUJ J r u 


7 Q ft ^ 1 Dl a amnH i nm 




39 


352 . 5 


57 . 


, 2 


173346 


9 


AC112184 


AC112184 Homo sapi 




40 


352 .5 


57. 


,2 


175561 


9 


AC012636 


AC012 63 6 Homo sapi 


c 


41 


335.5 


54 . 


.5 


79677 


8 


ATT21C14 


AL138639 Arabidops 


c 


42 


327 .5 


53 . 


,2 


177444 


2 


AC024617 


AC024617 Homo sapi 




43 


314 


51. 


. 0 


209982 


3 


CNS06C8G 


AL3 91737 chromosom 




44 


292 


47 . 


.4 


342 


9 


AF142060 


AF14 2 060 Homo sapi 


c 


45 


292 


47 . 


.4 


418 


6 


AX332170 


AX332170 Sequence 



ALIGNMENTS 



RESULT 1 
AF142059 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



AF142059 



32 7 bp 



Homo sapiens RING finger protein (R0C1) 
AF142059 

AF142059.1 GI.-4809215 



mRNA linear PRI 14 -JUL- 1999 
mRNA, complete cds . 



Homo sapiens . 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ,- 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 327) 

0hta,T., Michel, J. J., Schottelius , A. J. andXiong,Y. 

R0C1, a homolog of APC11, represents a family of cullin partners 

with an associated ubiquitin ligase activity 

Mol. Cell 3 (4), 535-541 (1999) 

99247022 

10230407 

2 (bases 1 to 327) 

0hta,T., Michel, J. and Xiong,Y. 
Direct Submission 

Submitted (10-APR-1999) Lineberger Comprehensive Cancer Center, 
University of North Carolina at Chapel Hill, Mason Farm Rd. and 
Manning Dr., Chapel Hill, NC 27599-7295, USA 



FEATURES 

source 



gene 
CDS 



BASE COUNT 
ORIGIN 



Location/Qualifiers 
1. .327 

/organism= ,, Homo sapiens" 

/db_xref="taxon: 9606" 

1. .327 

/gene="ROCl" 

1. .327 

/ gene =" ROC 1" 

/note="Cullin partnering protein with associated ubiquitin 

ligase activity" 

/codon_start=l 

/product="RING finger protein" 
/protein_id= "AAD30146 . 1" 
/db_xref="GI : 48092 16" 

/ trans lat ion= " MAAAMD VTJTPSGTNSGAGKKRFEVKKWNAVALWAWDI WDNCAI 
CRNH I MDLC I ECQANQAS ATS EECTVAWGVCNHAFHFHC I S RWLKTRQVC PLDNREWE 
FQKYGH" 

85 a 75 c 94 g 73 t 



Alignment Scores: 

Pred. No.: 5.64e-59 Length: 327 

Score: 616.00 Matches: 108 

Percent Similarity: 100.00% Conservative: 0 

Best Local Similarity: 100.00% Mismatches: 0 

Query Match: 100.00% Indels : 0 

DB: 9 Gaps: 0 



US- 


09-541 


Qy 


1 


Db 


1 


Qy 


21 


Db 


61 


Qy 


41 


Db 


121 


Qy 


61 


Db 


181 


Qy 


81 


Db 


241 


Qy 


101 


Db 


301 



-462B-2 (1-108) X AF142059 (1-327) 
MetAlaAlaAlaMetAspValAspThrProSerGlyThrAsnSerGlyAlaGlyLysLys 

IIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIMIII 

ATGGCGGCAGCGATGGATGTGGATACCCCGAGCGGCACCAACAGCGGCGCGGGCAAGAAG 



20 



llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
61 CGCTTTGAAGTGAAAAAGTGGAATGCAGTAGCCCTCTGGGCCTGGGATATTGTGGTTGAT 12 0 



IIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
AACTGTGCCATCTGCAGGAACCACATTATGGATCTTTGCATAGAATGTCAAGCTAACCAG 

AlaSerAlaThrSerGluGluCysThrValAlaTrpGlyValCysAsnHisAlaPheHis 

' i 1 1 li II ; 1 1 ! II' II 1 1 1 1 MM 1 1 1 li! I I III II M ! I ill II 1 1 1 1! II 

GCGTCCGCTACTTCAGAAGAGTGTACTGTCGCATGGGGAGTCTGTAACCATGCTTTTCAC 
PheHisCysIleSerArgTrpLeuLysThrArgGlnValCysProLeuAspAsnArgGlu 

MIMM MIMMIIMIM-MMIIIIMMIIIMMI. IMMIMMM 

TTCCACTGCATCTCTCGCTGGCTCAAAACACGACAGGTGTGTCCATTGGACAACAGAGAG 



MMI IMMMIIIIMMII 

TGGGAATTCCAAAAGTATGGGCAC 



180 



100 



RESULT 2 
AF140599 
LOCUS 



AF140599 



504 bp mRNA linear ROD ll-MAY-1999 



DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



Mus rausculus ring-box protein 1 (Rbxl) mRNA, 
AF140599 

AF140599.1 GI:4769005 



complete cds . 



gene 
CDS 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi ; Muridae; Murinae; Mus. 



Conaway,R.C. and Conaway,J.W. 



. BASE COUNT 
ORIGIN 



Mus musculus . 
Mus musculus 

Eukaryota; Metazoa; Chordata 
Mammalia; Eutheria; Rodentia 

1 (bases 1 to 504) 

Kamura,T. ( Koepp,D.M., Conrad, M.N . , Skowyra,D., More 1 and , R . J . 
Iliopoulos,0. , Lane,W.S,, Kaelin,W.G. Jr., Elledge, S . J. , 
Conaway, R. C. , Harper, J. W. and Conaway,J.W. 

Rbxl, a component of the VHL tumor suppressor complex and SCF 
ubiquitin ligase 

Science 284 (5414), 657-661 (1999) 
99234320 
10213691 

2 (bases 1 to 504) 
Kamura , T . , Lane , W . S . 
Direct Submission 
Submitted (05-APR-1999) Prog. Molec . Cell Biol., HHMI , Oklahoma 
Med. Res. Fdn., 825 NE 13th St, Oklahoma City, OK 73104, USA 

Location/ Qualifiers 
1. .504 

/organism="Mus musculus" 

/db_xref ="taxon : 10090 " 

1. .504 

/gene="Rbxl" 

18. .344 

/gene = 11 Rbxl" 

/note= "component of VHL tumor suppressor complex and SCF 

ubiquitin ligase" 

/codon_start=l 

/product=" ring-box protein 1" 
/protein_id= "AAD29716 . 1" 
/db_xref ="GI : 4769006" 

/translation^ 11 MAAAMD VD TPS GTNS G AGKKR FE VKKWN AVAL W AWD I WDNC A I 
CRNHIMDLCIECQANQASATSEECTVAWGVCNHAFHFHCISRWLKTRQVCPLDNREWE 
FQKYGH " 

117 a 107 c 137 g 143 t 



Alignment Scores: 

Pred. No. : 8 . 92e-59 

Score: 616.00 

Percent Similarity: 100.00% 
Best Local Similarity: 100.00% 

Query Match: 100.0 0% 

DB: 10 



Length: 
Matches : 
Conservative : 
Mismatches : 
Indels : 
Gaps : 



504 

108 

0 

0 

0 

0 



US-09-541-462B-2 (1-108) X AF140599 (1-504) 



Qy 1 MetAlaAlaAlaMetAspValAspThrProSerGlyThrAsnSerGlyAlaGlyLysLys 2 0 

IIIIIIIIIIIIMMIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 18 ATGGCGGCGGCGATGGATGTGGATACCCCCAGCGGCACCAACAGCGGCGCGGGCAAGAAG 77 



Qy 21 ArgPheGluValLysLysTrpAsnAlaValAlaLeuTrpAlaTrpAspIleValValAsp 4 0 



Minn mini uiiiiii li;:iiiiiirin: iMilililiiiii Ml 

Db 7 8 CGCTTTGAAGTTAAAAAGTGGAATGCAGTGGCCCTCTGGGCCTGGGACATTGTGGTTGAT 137 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 



4 1 AsnCys Al al leCys ArgAsnHi s I leMe tAspLeuCys I leGluCysGlnAlaAsnGln 6 0 

IIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

138 AACTGTGCCATCTGCAGGAACCACATTATGGATCTTTGTATCGAATGTCAGGCCAACCAG 197 
51 AlaSerAlaThrSerGluGluCysThrValAlaTrpGlyValCysAsnHisAlaPheHis 80 

IIIMMIIIIIMMIIIIIII Mlllllllllll IMIIIMIIIIIII II! 

198 GCGTCAGCTACTTCCGAAGAGTGTACGGTTGCATGGGGAGTCTGCAACCATGCTTTTCAT 257 



81 



100 



PheHisCysIleSerArgTrpLeuLysThrArgGlnValCysProLeuAspAsnArgGlu 

; 1 1 1 1 1 1 : 1 1 1 1 1 1 1 i M 1 1 ; 1 1 1 i ill! : - ■ 1 1 1 1 ; i 1 1 : . . ; I 

25 8 TTCCACTGCATCTCTCGATGGCTCAAAACGAGGCAGGTGTGTCCGTTGGACAACAGAGAG 317 
101 TrpGluPheGlnLysTyrGlyHis 108 

lllllillllllllllllllllll 

318 TGGGAGTTCCAGAAGTATGGGCAT 341 



RESULT 3 
AF140598 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



AF140598 508 bp 

Homo sapiens ring-box protein 1 
AF140598 

AF140598.1 GI:4769003 



gene 
CDS 



mRNA linear PRI 11 -MAY- 19 
(RBX1) mRNA, complete cds . 



Homo sapiens . 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 508) 

Kamura , T . , Koepp ,D.M. , Conrad , M . N . , Skowyra , D . , Moreland , R . J. , 
lliopoulos,0. , Lane,W.S., Kaelin,W.G. Jr., Elledge ,S.J. , 
Conaway, R.C. , Harper, J. W. and Conaway, J. W. 

Rbxl, a component of the VHL tumor suppressor complex and SCF 
ubiquitin ligase 

Science 284 (5414), 657-661 (1999) 

99234320 

10213691 

2 (bases 1 to 508) 

Kamura, T. , Lane,W.S., Conaway, R.C- and Conaway, J. W. 
Direct Submission 

Submitted (05-APR-1999) Prog. Molec. Cell Biol., HHMI, Oklahoma 
Med. Res. Fdn. , 825 NE 13th St, Oklahoma City, OK 73104, USA 

Location/Qualifiers 

1. .508 

/organism="Homo sapiens" 

/db_xref ="taxon: 9606" 

1. .508 

/gene="RBXl" 

7. .333 

/gene="RBXl" 

/note="ring finger-like protein; component of VHL tumor 
suppressor complex and SCF ubiquitin ligase" 
/codon_start=l 

/product=" ring-box protein 1" 
/protein_id= "AAD2 9715 . 1" 



BASE COUNT 
ORIGIN 



/db_xref="GI: 4 769004" 

/ trans lat ion= " MAAAMD VDT P S GTNSG AG KKR FE VKKWNAVAL WAWD I WDNC AI 
CRNHIMDLCIECQANQASATSEECTVAWGVCNHAFHFHCISRWLKTRQVCPLDNREWE 
FQKYGH" 



126 a 



106 c 



124 g 



152 t 



Alignment Scores: 

Pred. No.: 8.99e-59 Length: 508 

Score: 616.00 Matches: 108 

Percent Similarity: 100.00% Conservative: 0 

Best Local Similarity: 100.00% Mismatches: 0 

Query Match: 100.00% Indels : 0 

DB : 9 Gaps : 0 

US-09-541-462B-2 (1-108) x AF140598 (1-508) 



Qy 1 MetAlaAlaAlaMetAspValAspThrProSerGlyThrAsnSerGlyAlaGlyLysLys 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 7 ATGGCGGCAGCGATGGATGTGGATACCCCGAGCGGCACCAACAGCGGCGCGGGCAAGAAG 



66 



Qy 21 ArgPheGluValLysLysTrpAsnAlaValAlaLeuTrpAlaTrpAspIleValValAsp 40 

1IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 67 CGCTTTGAAGTGAAAAAGTGGAATGCAGTAGCCCTCTGGGCCTGGGATATTGTGGTTGAT 126 

Qy 41 AsnCysAlalleCysArgAsnHisIleMetAspLeuCysIleGluCysGlnAlaAsnGln 60 

Illlllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 127 AACTGTGCCATCTGCAGGAACCACATTATGGATCTTTGCATAGAATGTCAAGCTAACCAG 18 6 
Qy 61 AlaSerAlaThrSerGluGluCysThrValAlaTrpGlyValCysAsnHisAlaPheHis 80 

!IMI:I:I M.MII.ir IIIIMMM M;.,: Mill: II I I Ml 

Db 187 GCGTCCGCTACTTCAGAAGAGTGTACTGTCGCATGGGGAGTCTGTAACCATGCTTTTCAC 24 6 
Qy 81 PheHisCysIleSerArgTrpLeuLysThrArgGlnValCysProLeuAspAsnArgGlu 100 

MMMIIIIMMMMIMIM IMM II MIMMM.MIMIM MM 

Db 247 TTCCACTGCATCTCTCGCTGGCTCAAAACACGACAGGTGTGTCCATTGGACAACAGAGAG 3 06 
Qy 101 TrpGluPheGlnLysTyrGlyHis 10 8 

IMMIMMIMMIIMI II 

Db 307 TGGGAATTCCAAAAGTATGGGCAC 330 



RESULT 4 
BC001466 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



544 bp mRNA 
clone MGC:14 81 



linear PRI 
IMAGE: 313 8751, 



12-JUL-2001 
mRNA, 



BC001466 

Homo sapiens, ring-box 1 
complete cds . 
BC001466 

BC001466.1 GI: 12655214 
MGC. 

Homo sapiens . 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 544) 
Strausberg, R. 
Direct Submission 

Submitted ( 12 -DEC-2000) National Institutes of Health, Mammalian 



Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov 
COMMENT Contact: MGC help desk 

Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: ATCC 

cDNA Library Preparation: Rubin Laboratory 

cDNA Library Arrayed by: The I.M.A.G.E, Consortium (LLNL) 

DNA Sequencing by: National Institutes of Health Intramural 

Sequencing Center (NISC) , 

Gaithersburg, Maryland; 

Web site: http://www.nisc.nih.gov/ 
Contact: nisc_mgc@nhgri.nih.gov 

Shevchenko , Y , , Wetherby , K. D . , Beckstrom-Sternberg, S .M. , 
Benjamin, B., B lakes ley , R . W . , Bouf f ard, G .G. , Brinkley,C., Brooks, S., 
Dietrich, N. L . , Guan,X., Gupta, J., Ho,S.-L., Karlins,E., Legaspi,R,, 
Lim,M., Maduro,Q.L., Masiello,C, Mastrian, S . D . , McCloskey, J. C . , 
McDowell, J., Pearson ,R., Snyder, B. , Stantripop , S , , Thomas, P . J. , 
Tiongson, E . E . , Touchman , J . W . , Tsurgeon,C, Vogt,J.L., Walker, M . A. , 
Zhang, L.-H. and Green, E.D. 

Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/ LLNL at: http://image.llnl.gov 
Series: IRAL Plate: 4 Row: j Column: 15. 
FEATURES Location/Qualif iers 

source 1. .544 

/organism="Homo sapiens" 

/db_xref= ,, LocusID:9978" 

/db_xref ="taxon: 9606" 

/clone="MGC:1481 IMAGE : 3138751" 

/t i s sue_type= " Placenta , choriocarcinoma " 

/ c lone_l ib= " NIH__MGC_2 1 " 

/lab_host="DH10B-R" 

/note=" Vector : pOTB7" 
CDS 15. ,341 

/codon_start=l 

/product=" ring-box 1" 

/protein_id= "AAH01466. 1" 

/db_xref="GI : 12655215" 

/ trans la t ion= "MAAAMDVDTPSGTNSGAGKKRFEVKKWNAVALWAWDI WDNCAI 
CRNHIMDLCIECQANQASATSEECTVAWGVCNHAFHFHCISRWLKTRQVCPLDNREWE 
FQKYGH " 

BASE COUNT 157 a 107 c 129 g 151 t 

ORIGIN 

Alignment Scores : 



Pred. No. : 


9.67e-59 


Length: 


544 


Score : 


616.00 


Matches : 


108 


Percent Similarity: 


100. 00% 


Conservative : 


0 


Best Local Similarity: 


100 . 00% 


Mismatches : 


0 


Query Match: 


100 . 00% 


Indels : 


0 


DB: 


9 


Gaps : 


0 


US-09-541-462B-2 (1-108) 


x BC001466 


(1-544) 





Qy 1 MetAlaAlaAlaMetAspValAspThrProSerGlyThrAsnSerGlyAlaGlyLysLys 2 0 



Db 


15 


Qy 


21 


Db 


75 


Qy 


41 


Db 


135 


Qy 


61 


Db 


195 


Qy 


81 


Db 


255 


Qy 


101 


Db 


315 



II I II 1 1 II: II III i III 1 1 MM :M M III M MI'll 1 1 1! I II : ll I II II MM 



lllllll IIIMMIIMIIIIIIIIIIIMM M IMIMIIM llllllll Mill 

CGCTTTGAAGTGAAAAAGTGGAATGCAGTAGCCCTCTGGGCCTGGGATATTGTGGTTGAT 



108 



llllllllllllllllllllllll 
TGGGAATTCCAAAAGTATGGGCAC 



134 



60 



1 1 IN I i III ! I <l 1 1 : I 1 1 . II I III 1 1 II II 1 1 IM I II S I IM I II Ml I 

AACTGTGCCATCTGCAGGAACCACATTATGGATCTTTGCATAGAATGTCAAGCTAACCAG 



194 



AlaSerAlaThrSerGluGluCysThrValAlaTrpGlyValCysAsnHisAlaPheHis 80 

I MIMMMMMMMMMMMMMMMIM IMMMMMI I IM 

GCGTCCGCTACTTCAGAAGAGTGTACTGTCGCATGGGGAGTCTGTAACCATGCTTTTCAC 



PheHi s Cy s I leSerArgTrpLeuLy sThrArgGln ValCy s ProLeuAspAsnArgGlu 
llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 



254 



100 



314 



338 



RESULT 5 
BC017370 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



ring -box 1 



554 bp mRNA 
clone MGC: 13357 



linear PRI 
IMAGE : 4065797, 



14-NOV-2001 
mRNA, 



Chordata; Craniata; Vertebrata,- Euteleostomi ; 
Primates; Catarrhini; Hominidae; Homo. 



BC017370 
Homo sapiens 
complete cds . 
BC017370 

BC017370.1 GI:16924201 
MGC. 

Homo sapiens . 
Homo sapiens 
Eukaryota; Metazoa; 
Mammalia; Eutheria; 
1 (bases 1 to 554) 
Strausberg, R. 
Direct Submission 

Submitted (13-NOV-2001) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 
Contact: MGC help desk 
Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: ATCC 

cDNA Library Preparation: CLONTECH Laboratories, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Institute for Systems Biology 
http .* //www . systemsbiology . org 
contact : amadan@systemsbiology . org 

Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha 
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 
Series: IRAL Plate: 19 Row: c Column: 17 



This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi : 7657507. 
FEATURES Location/Qualifiers 
source 1. .554 

/organism= "Homo sapiens" 

/ db_xref= "Locus ID : 9978 " 

/ db_xre f - " t axon : 9 6 0 6 " 

/clone = "MGC: 13357 IMAGE : 4 0657 97 11 

/tissue_type= "Brain, glioblastoma" 

/ clone_l ib= " NIH_MGC_5 7 " 

/lab_host="DH10B» 

/note*= "Vector: pDNR-LIB" 
CDS 23. .349 

/codon_start=l 

/product=" ring-box 1" 

/protein_id="AAH1737 0 .1" 

/db_xref= ,, GI : 16924202" 

/translation="IW^AMDVDTPSGTNSGAGKKRFEVKKWNAVALWAWDIVVDNCAI 
CRNH I MD LCI E CQANQAS ATS EEC T VAWG VCNHAFHFHC I S RWL KTRQ VC P LDNRE WE 
FQKYGH " 



BASE COUNT 157 a 


107 c 133 


g 157 t 




ORIGIN 








Alignment Scores: 








Pred. No. : 


9.86e-59 


Length : 


554 


Score : 


616.00 


Matches : 


108 


Percent Similarity: 


100.00% 


Conservative : 


0 


Best Local Similarity: 


100.00% 


Mismatches : 


0 


Query Match: 


100 . 00% 


Indels : 


0 


DB: 


9 


Gaps : 


0 


US-09-541-462B-2 (1-108) 


x BC017370 { 


1-554) 





Qy 



Db 



23 



MetAlaAlaAlaMetAspValAspThrProSerGlyThrAsnSerGlyAlaGlyLysLys 2 0 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiinii 

ATGGCGGCAGCGATGGATGTGGATACCCCGAGCGGCACCAACAGCGGCGCGGGCAAGAAG 8 2 



Qy 21 ArgPheGluValLysLysTrpAsnAlaValAlaLeuTrpAlaTrpAspIleValValAsp 4 0 

lllllllllllllllillllllllllllllllllllllllllllllllllllllllllll 
Db 83 CGCTTTGAAGTGAAAAAGTGGAATGCAGTAGCCCTCTGGGCCTGGGATATTGTGGTTGAT 142 

Qy 41 AsnCysAlalleCysArgAsnHisIleMetAspLeuCysIleGluCysGlnAlaAsnGln 60 

lilllllliiiiiiiililllilllliliiliiiiiiiiiiiiiiiiiiiiiiiiiiin 

Db 14 3 AACTGTGCCATCTGCAGGAACCACATTATGGATCTTTGCATAGAATGTCAAGCTAACCAG 2 02 



Qy 



Db 



61 AlaSerAlaThrSerGluGluCysThrValAlaTrpGlyValCysAsnHisAlaPheHis 80 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

203 GCGTCCGCTACTTCAGAAGAGTGTACTGTCGCATGGGGAGTCTGTAACCATGCTTTTCAC 2 62 



Qy 81 PheHisCysIleSerArgTrpLeuLysThrArgGlnValCysProLeuAspAsnArgGlu 100 

IMI II ill II II! Ill I II I: Mill II Mill MINI 1 1 II II IMI II III 

Db 2 63 TTCCACTGCATCTCTCGCTGGCTCAAAACACGACAGGTGTGTCCATTGGACAACAGAGAG 322 



Qy 101 TrpGluPheGlnLysTyrGlyHis 108 

1 1 IM MM II II Ml! I II 1 1 

Db 323 TGGGAATTCCAAAAGTATGGGCAC 346 



RESULT 6 
BC027396 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



BC027396 1616 bp mRNA linear ROD 07-AUG-2002 

Mus musculus, ring-box 1, clone MGC : 35907 IMAGE : 4952242 , mRNA, 
complete cds . 
BC027396 

BC027396.1 GI:20072075 
MGC. 

house mouse. 
Mus musculus 

Eukarypta; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 1616) 
Strausberg, R . 
Direct Submission 

Submitted (04 -APR-2002) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 

Contact : MGC help desk 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Jeffrey Green M.D. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Baylor College of Medicine Human Genome 

Sequencing Center 

Center code: BCM-HGSC 

Web site: http://www.hgscbcm.tmc.edu/cdna/ 
Contact : amg@bcm . tmc . edu 

Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Hale, S.M., 
Yoon, V.S., Kowis, C.R., Lawrence, S., Martin, R.G., Muzny, D.M. , 
Richards, S., Gibbs, R.A. 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 
Series: IRAK Plate: 60 Row: 1 Column: 8 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi : 9790190. 
FEATURES Location/Qualifiers 
source 1. .1616 

/organism="Mus musculus" 
/db_xr e f = " t axon : 1 0 0 9 0 " 
/map=" FVB/N" 

/clone= "MGC: 35907 IMAGE : 4 952242 " 

/tissue_type= "Mammary tumor. C3{l)-Tag model. Infiltrating 
ductal carcinoma. 5 month old virgin mouse." 
/clone_lib= "NCI_CGAP_Mam6 11 
/lab_host="DH10B" 
/note= "Vector : pCMV-SP0RT6" 
CDS 7. .333 

/ codon_start=l 
/product^" ring-box 1" 
/protein_id= "AAH27396 .1" 
/db_xref ="GI :20072076" 
/db_xref ="LocusID: 9978" 

/trans 1 a t ion= " MAAAMD VDTPSGTNS GAGKKRFE VKKWNAVALWAWD I WDNCAI 



BASE COUNT 
ORIGIN 



CRNHIMDLCIECQANQASATSEECTVAWGVCNHAFHFHCISRWLKTRQVCPLDNREWE 
FQKYGH" 

453 a 330 c 407 g 426 t 



Alignment Scores: 

Pred. No.: 3.06e-58 Length: 1616 

Score: 616.00 Matches: 108 

Percent Similarity: 100.00% Conservative: 0 

Best Local Similarity: 100.00% Mismatches: 0 

Query Match: 100.00% Indels : 0 

DB: 10 Gaps: 0 

US-09-541-462B-2 (1-108) x BC027396 (1-1616) 

Qy 1 MetAlaAlaAlaMetAspValAspThrProSerGlyThrAsnSerGlyAlaGlyLysLys 20 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 III I MINI:; MIIIIIMIII IIIM 'h II III III 'M 1 1 

Db 7 ATGGCGGCGGCGATGGATGTGGATACCCCCAGCGGCACCAACAGCGGCGCGGGCAAGAAG 66 

Qy 21 ArgPheGluValLysLysTrpAsnAlaValAlaLeuTrpAlaTrpAspIleValValAsp 4 0 

ii ill! i I I i . 1 1 ! ; ! 1 1 ; 1 1 1 1 1 - i 1 1 : 1 1 : 1 1 1 1 1 1 : 

Db 67 CGCTTTGAAGTTAAAAAGTGGAATGCAGTGGCCCTCTGGGCCTGGGACATTGTGGTTGAT 12 6 

Qy 41 AsnCysAlalleCysArgAsnHisIleMetAspLeuCysIleGluCysGlnAlaAsnGln 60 

IIIMIMIIIIMIIIIIIIIIIMIIIIIlllllllllllllllllllllllllllll 

Db 12 7 AACTGTGCCATCTGCAGGAACCACATTATGGATCTTTGTATCGAATGTCAGGCCAACCAG 186 

Qy 61 AlaSerAlaThrSerGluGluCysThrValAlaTrpGlyValCysAsnHisAlaPheHis 80 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 187 GCGTCAGCTACTTCCGAAGAGTGTACGGTTGCATGGGGAGTCTGCAACCATGCTTTTCAT 24 6 

Qy 81 PheHisCysIleSerArgTrpLeuLysThrArgGlnValCysProLeuAspAsnArgGlu 100 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIII 

Db 24 7 TTCCACTGCATCTCTCGATGGCTCAAAACGAGGCAGGTGTGTCCGTTGGACAACAGAGAG 306 



Qy 101 TrpGluPheGlnLysTyrGlyHis 108 

ii in . ii I ii : ;iiii 

Db 3 07 TGGGAGTTCCAGAAGTATGGGCAT 33 0 



RESULT 7 
AY027936 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
REFERENCE 



AY027936 1933 bp mRNA linear VRT 19-MAR-2002 

Salmo salar hyperosmotic protein 21 mRNA, complete cds. 

AY027936 

AY027936.1 GI:19067883 

Salmo salar. 
Salmo salar 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Actinopterygii; Neopterygii; Teleostei; Euteleostei; 
Protacanthopterygii; Salmonif ormes ; Salmonidae; Salmo. 

1 (bases 1 to 1933} 

Pan,F., Zarate, J. and Bradley, T.M. 

A homolog of the E3 ubiquitin ligase Rbxl is induced during 
hyperosmotic stress of salmon 

Am. J. Physiol. Regul . Integr. Comp. Physiol. 282 (2002) In press 

2 (bases 1 to 1933) 



AUTHORS 

TITLE 

JOURNAL 



FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



Bradley, T.M. and Pan,F. 
Direct Submission 

Submitted (22 -FEB-2001) Fisheries, Animal and Veterinary Science, 
University of Rhode Island, Bldg.14, East Farm, Route 108, 
Kingston, RI 02881, USA 

Location/ Qualifiers 

1. .1933 

/organism="Salmo salar" 
/ db_xr e f = " t axon : 8 0 3 0 " 
/tissue_type="gill" 
1135. .1704 

/note="SH0P21; putative E3 ubiquitin ligase; similar to 

human and murine RBX1 protein" 

/codon_start=l 

/product=" hyperosmotic protein 21" 
/protein_id="AAK29182 .1" 
/db_xref="GI : 19067884" 

/ translations "MSEGVPSPVCPSGTLAQAHQSPHRLPMEGQAAQAAGSTNVCWLR 
LKASADHLIHVALRILCNVMHSHIVHLVPNSARAIW 

F E VKKWN AVALWAWD I WDNCAI CRNH I MDLCI ECQANQAS ATS EE CTVAWGVCNHAF 
HFHC I SRWLKTRQVCPLDNREWEFQKYGH 11 
518 a 466 c 443 g 506 t 



Alignment Scores: 

Pred. No. : 7 .7e-57 

Score: 604.00 

Percent Similarity: 98.15% 
Best Local Similarity: 98.15% 

Query Match: 98.05% 

DB: 5 



Length : 
Matches : 
Conservative : 
Mismatches : 
Indels : 
Gaps : 



1933 

106 

0 

2 

0 

0 



US-09-541-462B-2 (1-108) X AY027936 (1-1933) 

Qy 1 MetAlaAlaAlaMetAspValAspThrProSerGlyThrAsnSerGlyAlaGlyLysLys 2 0 

Illllllllllllllllllllllllllllllll lllllllllllllll llllll 

Db 137 8 ATGGCGGCAGCGATGGATGTTGATACCCCAAGCGCC ACGAATAGTGGAGCAAGCAAGAAA 1437 
Qy 21 ArgPheGluValLysLysTrpAsnAlaValAlaLeuTrpAlaTrpAspIleValValAsp 4 0 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 

Db 143 8 CGTTTTGAAGTGAAGAAGTGGAATGCAGTGGC ACTTTGGGCCTGGGACATTGTGGTGGAC 14 97 
Qy 41 AsnCysAlalleCysArgAsnHisIleMetAspLeuCysIleGluCysGlnAlaAsnGln 60 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIINIIIIIIIIIIMMIII 

Db 14 98 AACTGTGCCATCTGTAGGAATCACATTATGGATCTCTGCATAGAGTGCCAGGCTAACCAG 1557 



Qy 



Db 



61 AlaSerAlaThrSerGluGluCysThrValAlaTrpGlyValCysAsnHisAlaPheHis 8 0 

IIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
155 8 GCCTCTGCCACATCAGAGGAGTGCACCGTAGCCTGGGGAGTCTGCAATCATGCATTCCAT 1617 



Qy 81 PheHisCysIleSerArgTrpLeuLysThrArgGlnValCysProLeuAspAsnArgGlu 100 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 

Db 1618 TTCCACTGTATCTCCCGTTGGTTGAAGACCAGGCAGGTGTGCCCCCTAGACAACAGGGAG 1677 



Qy 101 TrpGluPheGlnLysTyrGlyHis 108 

1 1 1 1 M 1 1 1 1 1 !: 1 : 1 1 1 1 1 1 1 1 1 

Db 1678 TGGGAGTTTCAGAAATATGGACAC 1701 



HUMYQ60A05 497 bp mRNA linear PRI 29-AUG-1998 

Homo sapiens full length insert cDNA clone YQ60A05. 
AF085906 

AF085906.1 GI:3483246 
FLI_CDNA. 
Homo sapiens . 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 497) 

Woessner, J. ( Tan,F., Marra,M., Kucaba,T., Yandell,M., Martin, J., 
Marth,G., Bowles, L., Wylie,T., Bowers, Y., Steptoe,M., Theising,B., 
Geisel,S., Allen, M. f Underwood, K. , Chappell , J. , Person, B., 
Gibbons , M. , Harvey, N. , Pape,D., Chamberlain, A. , Morales, R. , 
Schurk,R., Ritter,E., Kohn,S., Swaller,T., Behymer,K., Hillier,L., 
Wilson, R. and Waterston,R. 

Full Clone Sequencing of the Longest Available Member from Each 
Unigene Cluster 
Unpublished 

2 {bases 1 to 497) 
Waterston,R. 
Direct Submission 

Submitted (24-AUG-1998) Department of Genetics, Washington 
University, 4444 Forest Park Avenue, St. Louis, Missouri 63108, USA 
SUBMITTED BY: 
Genome Sequencing Center 
Department of Genetics 
Washington University 
St. Louis MO 63108, USA 
http:/ /genome .wustl .edu/gsc 
mailto : est@watson. wustl . edu 

NOTICE: This sequence represents the full insert of this cDNA. No 
attempt has been made to verify whether this corresponds to the 
full-length of the original mRNA from which it was derived. We 
have tried to obtain double -stranded, or double chemistry sequence 
across the entire clone, but potentially, there are areas in the 
sequence where this level of coverage was not achieved. 
Nevertheless, we are confident of the accuracy of this sequence as 
all regions of low quality, as defined by PHRAP {P. Green, in 
preparation) , were visually inspected and edited accordingly. The 
consensus quality values for this sequence have been submitted 
separately. 

SIMILARITY INFORMATION: 

similar to Caenorhabditis elegans protein 270757 (PID :gl262999) 
ZK287 .5 

The location of this clone is unknown. 
FEATURES Location/Qualif iers 

source 1. .4 97 

/organism="Homo sapiens" 
/ db_xre f = t axon : 9 6 0 6 " 
/clone=" IMAGE: 200144" 



RESULT 8 
HUMYQ60A05 
LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



/clone_lib= "Soares_f etal_liver_spleen_lNFLS " 
misc_feature 3. .3 05 

/note=" similar to Caenorhabditis elegans protein Z70757 

<PID:gl262999) " 
misc_f eature 18. .2 84 

/note="similar to Caenorhabditis elegans protein U80449 

(PID:gl707068) " 
misc_f eature 36. .302 

/note= M similar to Saccharomyces cerevisiae protein S66830 

(PID:g2132017) " 
misc_f eature 42. .3 02 

/note= "similar to Schizosaccharomyces pombe protein Z98977 

(PID:g2388937) " 
misc_f eature 51. .284 

/note= "similar to Caenorhabditis elegans protein Z46242 

(PID:g559430) " 



BASE COUNT 131 a 


101 c 117 


g 148 t 




ORIGIN 








Alignment Scores: 








Pred. No. : 


2 . 87e-55 


Length : 


497 


Score : 


584.00 


Matches : 


101 


Percent Similarity: 


100 . 00% 


Conservative : 


0 


Best Local Similarity: 


100 .00% 


Mismatches : 


0 


Query Match: 


94 . 81% 


Indels : 


0 


DB: 


9 


Gaps : 


0 


US-09-541-462B-2 (1-108] 


1 x HUMYQ60A05 


(1-497) 





Qy 


8 


Db 


3 


Qy 


28 


Db 


63 


Qy 


48 


Db 


123 


Qy 


68 


Db 


183 


Qy 


88 


Db 


243 


Qy 


108 


Db 


303 


RESULT 9 


AX281690/( 


LOCUS 





IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIII 

GATACCCCGAGCGGCACCAACAGCGGCGCGGGCAAGAAGCGCTTTGAAGTGAAAAAGTGG 



HisIleMetAspLeuCysIleGluCysGlnAlaAsnGlnAlaSerAlaThrSerGluGlu 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CACATTATGGATCTTTGCATAGAATGTCAAGCTAACCAGGCGTCCGCTACTTCAGAAGAG 



62 



AsnAlaValAlaLeuTrpAlaTrpAspIleValValAspAsnCysAlalleCysArgAsn 4 7 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

AATGC AGT AGC C C T C TGGG C CTGGG ATATTGTGGTTG AT AAC TGTGCC ATCTGC AGGAAC 



122 



67 



182 



llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
TGTACTGTCGCATGGGGAGTCTGTAACCATGCTTTTCACTTCCACTGCATCTCTCGCTGG 



Mill; lllllli II: I I I ! I I ! ; I I i i I I I I I I I I ! I I I I I I I M I I ; I I i MM M, 

CTCAAAACACGACAGGTGTGTCCATTGGACAACAGAGAGTGGGAATTCCAAAAGTATGGG 



87 



242 



DEFINITION 



AX281690 5347 bp DNA 

Sequence 99 from Patent WO01773 89. 



linear PAT 02-NOV-2001 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 



FEATURES 

source 



BASE COUNT 
ORIGIN 



AX281690 
AX281690 . 1 



GI: 16608941 



human . 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Shiffman,D., Somogyi,R., Lawn,R., Seilhamer , J. J . , Porter,G.J., 
Mikita f T. and Tai,J. 

Genes expressed in foam cell differentiation 
Patent: WO 0177389-A 99 18-OCT-2001; 
Incyte Genomics, Inc. (US) 

Location/Qualifiers 

1. .5347 

/organism="Homo sapiens" 
/db_xref =" taxon: 9606" 
/note=" Incyte ID No: 364940.19" 
1489 a 1277 c 1094 g 1487 t 



Alignment Scores: 

Pred. No.: 5.74e-53 Length: 5347 

Score: 573.00 Matches: 103 

Percent Similarity: 99.04% Conservative: 0 

Best Local Similarity: 99.04% Mismatches: 0 

Query Match: 93.02% Indels : 1 

DB: 6 Gaps: 0 

US-09-541-462B-2 (1-108) x AX281690 (1-5347) 



Qy 


3 


Db 


515 


Qy 


22 


Db 


455 


Qy 


42 


Db 


395 


Qy 


62 


Db 


335 


Qy 


82 


Db 


275 


Qy 


102 


Db 


215 



AlaAlaMetAspValAspThrProSerGlyThrAsnSerGlyAla-GlyLysLysArgPh 22 

1 1 1 II I II II II 1 1 II II I 'MM I M M II I, I, lirill II I, IM MIII 



eGluValLysLysTrpAsnAlaValAlaLeuTrpAlaTrpAspIleValValAspAsnCy 42 

1 1 1 : 1 1 1 1 m i mi 1 1 1 : 1 1 1 i ' 1 1 1 1 1 i m i ! . 

TGAAGTGAAAAAGTGGAATGCAGTAGCCCTCTGGGCCTGGGATATTGTGGTTGATAACTG 396 
sAlalleCysArgAsnHisIleMetAspLeuCysIleGluCysGlnAlaAsnGlnAlaSe 62 

IIMIMIIIIIIIIIIMI MMMMMMMMMM II MMIIIMMII 

TGCCATCTGCAGGAACCACATTATGGATCTTTGCATAGAATGTCAAGCTAACCAGGCGTC 336 

rAlaThrSerGluGluCysThrValAlaTrpGlyValCysAsnHisAlaPheHisPheHi 82 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
CGCTACTTCAGAAGAGTGTACTGTCGCATGGGGAGTCTGTAACCATGCTTTTCACTTCCA 276 

sCysIleSerArgTrpLeuLysThrArgGlnValCysProLeuAspAsnArgGluTrpGl 102 

II I IMMI II III I II M M M II M I MM II II M II M M IMI M II Ml 1 1 

CTGCATCTCTCGCTGGCTCAAAACACGACAGGTGTGTCCATTGGACAACAGAGAGTGGGA 216 



III MIM 



RESULT 10 
HSTEST/c 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
FEATURES 

source 



CDS 



HSTEST 3484 bp mRNA linear PRI 01-MAY-1995 

H. sapiens mRNA for testican. 

X73608 

X73608.1 GI:793844 
testican . 
Homo sapiens . 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 3484) 

Alliel,P.M., Perin,J.P., Jolles,P. and Bonnet, F.J. 

Testican, a multidomain testicular proteoglycan resembling 

modulators of cell social behaviour 

Eur. J. Biochem. 214 (1), 347-350 (1993) 

93285162 

8389704 

Location/Qualifiers 
1. .3484 

/organism= "Homo sapiens" 
/db_xref="t axon: 9606" 
/clone_lib=" lambda gtll" 
435. .1754 
/codon_start=l 
/product= n testican" 
/protein_id= "CAA51999 . 1 " 
/db_xref ="GI : 793 845" 
/ db_xr e f = " S PTREMBL : Q0 8 6 2 9 " 

/ 1 rans 1 a t i on= " MPAI AVLAAAAAAWCFLQVESRHLDALAGGAGPNHGNFLDNDQW 
LSTVSQYDRDKYWNRFRDDDYFRNWNPNKPFDQALDPSKDPCLKVKCSPHKVCVTQDY 
QTALCVSRKHLLPRQKKGNVAQKHWVGPSNLVKCKPCPVAQSAMVCGSDGHSYTSKCK 
LEFHACSTGKSLATLCDGPCPCLPEPEPPKHKAERSACTDKELRNLASRLKDWFGALH 
EDANRVIKPTSSNTAQGRFDTSILPICKDSLGWMFNKLDMNYDLLLDPSEINAIYLDK 
YEPCIKPLFNSCDSFKDGKLSNNEWCYCFQKPGGLPCQNEMNRIQKLSKGKSLLGAFI 
PRCNEEGYYKATQCHGSTGQCWCVDKYGNELAGSRKQGAVSCEEEQETSGDFGSGGSV 
VLLDDLEYERELGPKDKEGKLRVHTRAVTEDDEDEDDDKEDEVGYIW" 



BASE COUNT 
ORIGIN 



910 c 



808 g 



872 t 



Alignment Scores: 

Pred. No.: 4.45e-51 Length: 3484 

Score: 554.00 Matches: 98 

Percent Similarity: 100.00% Conservative: 0 

Best Local Similarity: 100.00% Mismatches: 0 

Query Match: 89.94% Indels: 0 

DB : 9 Gaps : 0 

US-09-541-462B-2 (1-108) X HSTEST (1-3484) 



Qy 



Db 



3 AlaAlaMetAspValAspThrProSerGlyThrAsnSerGlyAlaGlyLysLysArgPhe 22 

MM : h 1 1 MM 1 1 1 i I !IIM 1 1 III 1 1 1 MM I 1 1 II! II I 1 1 1 1 MM M 

2 96 G CAGCG ATGG ATGTGG AT AC CCCG AGCGG C AC C AAC AGCGG C G CGGGC AAG AAGCGCTTT 2 3 7 



Qy 23 GluValLysLysTrpAsnAlaValAlaLeuTrpAlaTrpAspIleValValAspAsnCys 42 

^ Ml I 1 1 1 : li II I 1 1 1 Mi Ml I M iMI I i MM M M I IMII 1 1 1 S : II Ml I M 

Db 23 6 GAAGTGAAAAAGTGGAATGCAGTAGCCCTCTGGGCCTGGGATATTGTGGTTGATAACTGT 177 



Qy 



43 AlalleCysArgAsnHisIleMetAspLeuCysIleGluCysGlnAlaAsnGlnAlaSer 62 



1 1 1 1 1 1 1 I: I ! ! 1 1 1 1 1 1 I M 1 1 i , M 1 1 1 1 1 1 M 1 1 1 i 'II I IIIMIIIMII I 

Db 176 GCCATCTGCAGGAACCACATTATGGATCTTTGCATAGAATGTCAAGCTAACCAGGCGTCC 117 

Qy S3 AlaThrSerGluGluCysThrValAlaTrpGlyValCysAsnHisAlaPheHisPheHis 82 

lllllllllllllllllillllllllllllllllllllllllllllllllllllllllll 
Db 116 GCTACTTCAGAAGAGTGTACTGTCGCATGGGGAGTCTGTAACCATGCTTTTCACTTCCAC 57 

Qy 83 CysIleSerArgTrpLeuLysThrArgGlnValCysProLeuAspAsnArgGlu 100 

llllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 56 TGCATCTCTCGCTGGCTCAAAACACGACAGGTGTGTCCATTGGACAACAGAGAG 3 



RESULT 11 

AY099360 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



CDS 



AY099360 306 bp mRNA linear PRI 13-MAY-2002 

Homo sapiens ZYP protein mRNA, partial cds . 

AY099360 

AY099360.1 GI:20502054 



Chordata; Craniata; Vertebrata; Euteleostomi ; 
Primates; Catarrhini; Hominidae; Homo. 



Goudou,D., Belkadi,L., 



BASE COUNT 
ORIGIN 



Homo sapiens . 
Homo sapiens 
Eukaryota; Metazoa 
Mammalia; Eutheria 

1 (bases 1 to 306) 

Perin, J. -P. , Seddiqi,N., Charbonnier , F 
Rieger,F. and Alliel,P.M. 

Genomic organization and expression of the ubiquitin-proteasome 
complex-associated protein Rbxl/ROCl/Hrtl 

Cell. Mol. Biol. (Noisy- le-grand) 45 (8), 1131-1137 (1999) 

20106778 

10643962 

2 (bases 1 to 306) 

Alliel,P.M., Seddiqi,N., Belkadi,L., Lecoeur,L. and Perin,J.P. 
Direct Submission 

Submitted (23-APR-2002) U4 88, INSERM, 80, rue du General Leclerc, 
Le Kremlin-Bicetre 94270, France 

Location/Qualifiers 

1. .306 

/organism="Homo sapiens" 
/db_xref ="t axon: 9606" 
/clone=" zyp3 " 
/sex= "male" 

/tissue_type= "whole brain" 
/dev_stage= "26-week fetus" 
/note= "Caucasian" 
<1. .290 

/note=" similar to RBXl/ROCl/HRTl" 
/codon_start=3 
/product="ZYP protein" 
/protein_id= "AAM21718 .1" 
/db_xref="GI : 20502055" 

/ t r ans 1 a t i on= " NSGASKKRFEVKKWNAVALWAWDI WDNCAI CRNHIMDLC IECQ 
ANQASATSEECTVAWGVCNHAFHFHCISRWLKTRQVCPLDNREWEFQKYGH" 
84 a 70 c 80 g 72 t 



Alignment Scores: 
Pred. No. : 



3 .29e-51 



Length : 



306 



Score: 545.00 

Percent Similarity: 98.95% 

Best Local Similarity: 98.95% 

Query Match: 88.4 7% 

DB: 9 



Matches : 
Conservative : 
Mismatches : 
Indels : 
Gaps : 



94 

0 

1 

0 

0 



US-09-541-462B-2 (1-108) X AY099360 (1-306) 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 



14 AsnSerGlyAlaGlyLysLysArgPheGluValLysLysTrpAsnAlaValAlaLeuTrp 33 

llllllllllll lllllllllllllllllllllllllllllllllllllllllllll 
3 AACAGCGGCGCGAGCAAGAAGCGCTTTGAAGTGAAAAAGTGGAATGCAGTAGCCCTCTGG 62 



34 



53 



AlaTrpAsp I le Valval AspAsnCys Alal leCys ArgAsnHi s I leMe t AspLeuCy s 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

6 3 GCCTGGGATATTGTGGTTGATAACTGTGCCATCTGCAGGAACCACATTATGGATCTTTGC 122 
54 IleGluCysGlnAlaAsnGlnAlaSerAlaThrSerGluGluCysThrValAlaTrpGly 73 

i ! I 1 1 1 1 i I ' ! I i : 1 1 1 1 1 1 1 1 > i ! I i I ! I 

12 3 ATAGAATGTCAAGCTAACCAGGCGTCCGCTACTTCAGAAGAGTGTACTGTCGCATGGGGA 182 
74 Va ICy s AsnHi s Al aPheHi s PheHi sCys 1 1 eSerArgTrpLeuLy sThrArgGlnVal 93 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

183 GTCTGTAACCATGCTTTTCACTTCCACTGCATCTCTCGCTGGCTCAAAACACGACAGGTG 242 
94 CysProLeuAspAsnArgGluTrpGluPheGlnLysTyrGlyHis 108 

IIIIIIIMIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIill 

243 TGTCCATTGGACAACAGAGAGTGGGAATTCCAAAAGTATGGGCAC 2 87 



RESULT 12 

AY119265 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 



COMMENT 



AY119265 1181 bp mRNA linear INV 16-JUN-2002 

Drosophila melanogaster SD23839 full insert cDNA. 

AY119265 

AY119265.1 GI:21430893 

FLI_CDNA. 

fruit fly. 

Drosophila melanogaster 

Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 
Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; 
Ephydroidea; Drosophilidae ; Drosophila. 
1 (bases 1 to 1181) 

Stapleton,M. , Brokstein, P . , Hong,L., Agbayani,A., Carlson, J., 
Champe,M., Chavez,C, Dorsett,V. , Dresnek,D., Farfan,D., Frise,E., 
George, R. , Gonzalez, M. , Guarin,H., Kronmiller , B . , Li, P., Liao,G., 
Miranda, A. , Mungall, C. J. , Nunoo,J., 
Patel , S . , Phouanenavong , S . , Wan , K . , 
and Celniker , S . 
Direct Submission 

Submitted (07-JUN-2002) Berkeley Drosophila Genome Project, 
Lawrence Berkeley National Laboratory, One Cyclotron Road, 
Berkeley, CA 9472 0, USA 
Sequence submitted by: 
Berkeley Drosophila Genome Project 

Lawrence Berkeley National Laboratory 
Berkeley, CA 94720 

This clone was sequenced as part of a high -throughput process to 
sequence clones from Drosophila Gene Collection 1 (Rubin et al., 



Pacleb,J., Paragas,V., Park,S. 
Yu,C, Lewis, S.E., Rubin, G.M. 



Science 2 000) . The sequence has been subjected to integrity checks 
for sequence accuracy, presence of a polyA tail and contiguity 
within 100 kb in the genome. Thus we believe the sequence to 
reflect accurately this particular cDNA clone. However, there are 
artifacts associated with the generation of cDNA clones that may 
have not been detected in our initial analyses such as internal 
priming, priming from contaminating genomic DNA, retained introns 
due to reverse transcription of unspliced precursor RNAs, and 
reverse transcriptase errors that result in single base changes. 
For further information about this sequence, including its location 
and relationship to other sequences, please visit our Web site 
(http://fruitfly.berkeley.edu) or send email to 
cdna@fruitfly.berkeley.edu . 
FEATURES Location/Qualif iers 

source 1. .1181 

/organism="Drosophila melanogaster" 

/db_xref = " taxon : 72 2 7 " 
gene 1. .1181 

/gene="Rocla" 

/ db_xr e f = " FLYBASE : FBgnO 02 5638" 
CDS 162. .488 

/gene="Rocla" 

/note= "Longest ORF" 

/codon_start=l 

/product = " SD23 83 9p " 

/protein_id="AAM5112 5 . 1" 

/db_xref="GI : 2 14308 94" 

/ db_xr e f = " FLYBASE : FBgnO 0 2 5 6 3 8 11 

/ 1 r ans 1 a t ion= " ME VDEDGYEVPSS S S KGDKKRFEVKKWNAVALWAWD I WDNCAI 
CRNHIMDLCIECQANQASATSEECTVAWGVCNHAFHFHCISRWLKTRQVCPLDNREWD 
FQKYGH" 



BASE COUNT 34 6 a 


246 c 


288 g 301 t 




ORIGIN 








Alignment Scores: 








Pred. No. : 


1.73e-49 


Length : 


1181 


Score: 


535.00 


Matches : 


95 


Percent Similarity: 


90 . 09% 


Conservative : 


5 


Best Local Similarity: 


85.59% 


Mismatches : 


5 


Query Match: 


86.85% 


Indels : 


6 


DB: 


3 


Gaps : 


2 



US-09-541-462B-2 (1-108) X AY119265 (1-1181) 



Qy 3 AlaAlaMetAspValAsp ThrProSerGlyThrAsnSerGlyAla 17 



Db 156 TCCACCATGGAAGTCGACGAGGATGGATACGAGGTTCCCTCCAGCAGCAGCAAGGGC - - - 212 
Qy 18 GlyLysLysArgPheGluValLysLysTrpAsnAlaValAlaLeuTrpAlaTrpAspIle 37 

I I ! 1 1 1 1 1 1 I : 1 1 1 I I ! I h : 1 1 1 i 1 1 1 1 1 IN 1 1 ' M I ; ! 1 1 i I : 1 1 1 1 1 ! i i I I I 

Db 213 GATAAGAAGCGCTTTGAGGTGAAGAAGTGGAACGCCGTGGCTCTGTGGGCCTGGGACATC 2 72 
Qy 38 ValValAspAsnCysAlalleCysArgAsnHisIleMetAspLeuCysIleGluCysGln 57 

IIIIIIIIIIIIIIIIIIMIIIIIIilllllllllllllllllllllllllllllllll 

Db 273 GTGGTGGACAACTGCGCCATCTGCCGCAACCACATCATGGACTTGTGCATCGAGTGTCAG 332 
Qy 5 8 AlaAsnGlnAlaSerAlaThrSerGluGluCysThrValAlaTrpGlyValCysAsnHis 77 



IIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIHIIIIIIIIIIIIIIM 

Db 333 G CGAACC AGG CGTC CGCC ACTAGCG AGG AGTGC ACCGTGG C CTGGGGCGT C TGC AACC AC 3 92 



Qy 78 AlaPheHisPheHisCysIleSerArgTrpLeuLysThrArgGlnValCysProLeuAsp 97 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIII 

Db 3 93 GCCTTCCATTTCCACTGCATCTCTCGCTGGCTAAAGACGCGCCAGGTATGCCCACTGGAC 4 52 

Qy 98 AsnArgGluTrpGluPheGlnLysTyrGlyHis 108 

lllllllllll|:::||||lllillllllllll 
Db 453 AACCGCGAGTGGGATTTCCAGAAGTACGGCCAC 4 85 



RESULT 13 

AY086913 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 



FEATURES 



AY086913 635 bp mRNA linear PLN 26-JUN-2002 

Arabidopsis thaliana clone 29408 mRNA, complete sequence. 
AY086913 

AY086913.1 GI:21405628 
FLI_CDNA. 
thale cress. 
Arabidopsis thaliana 

Eukaryota; Viridiplantae ; Streptophyta; Embryophyta; Tracheophyta ; 
Spermatophyta; Magnoliophyta ; eudicotyledons ; core eudicots; 
Rosidae; eurosids II; Brassicales ; Brassicaceae ; Arabidopsis. 

1 (bases 1 to 635) 

Haas, B. J., Volf ovsky , N. , Town, CD., Troukhan,M., Alexandrov, N . , 
Feldmann, K. A. , Flavell , R . B . , White, O. and Salzberg, S . L . 
Full-length messenger RNA sequences greatly improve genome 
annotation 

Genome Biol. (2002) In press 

2 (bases 1 to 635) 

Brover,V., Troukhan,M., Alexandrov, N. , Lu,Y.-P., Flavell, R. and 
Feldmann, K. 

Full -Length cDNA from Arabidopsis thaliana 
Unpublished 

3 (bases 1 to 635) 

Brover,V., Troukhan,M., Alexandrov, N . , Lu,Y.-P., Flavell ,R. and 
Feldmann, K. 
Direct Submission 

Submitted (ll-MAR-2002) Ceres, Inc , 3 007 Malibu Canyon Road, 
Malibu, CA 90265, USA 

This clone sequence is one of 5,000 Ceres full-length cDNAs made 
available to TIGR and Genbank. The following quality assessment of 
this set was done by comparison with known proteins: two percent of 
the clones are estimated to be 5 1 -truncated; less than one percent 
are 3 ' -truncated; approximately two percent represent alternative 
splice variants, including unspliced introns and spliced exons; one 
percent may contain premature stop codons; five percent may have 
frame shifts in a coding region. A sequence is considered to be 
5' -truncated if it lacks the translation initiation start (ATG) . A 
sequence is considered to be 3 ' -truncated if it lacks the 
C-terminal end of the encoded protein. Please note that these cDNA 
sequences are derived from the Ws or LAer ecotypes and therefore 
may contain polymorphisms when compared to sequences from Col-0, 
Genset carried out the library production and sequencing of the 
full-length clones. Ceres, Inc. carried out the clustering of the 
5' sequences, selection of clones, and sequence assembly. 
Location/Qualifiers 



CDS 



BASE COUNT 
ORIGIN 



1. .635 

/organism= ,, Arabidopsis thaliana" 

/db_xref ="taxon: 3702 " 

/clone= n 29408" 

91. .447 

/codon_start=l 

/product = "ring-box protein- like" 
/protein_id="AAM64477 .1" 
/db_xref="GI : 21592528" 

/translat ion= " MATLDSDVTMI PAGE ASS SVAAS S SNKKAKRFE I KKWS AVALWA 
WDIWDNCAICRNHIMDLCIECQANQASATSEECTVAWGVCNHAFHFHCISRWLKTRQ 
VCPLDNSEWEFQKYGH " 



164 a 



135 c 



144 g 



192 t 



Alignment Scores: 

Pred. No.: 1.24e-47 Length: 635 

Score: 515.50 Matches: 92 

Percent Similarity: 83.19% Conservative: 7 

Best Local Similarity: 77.31% Mismatches: 9 

Query Match: 83.69% Indels: 11 

DB: 8 Gaps: 2 

US-09-541-462B-2 (1-108) X AY086913 (1-635) 

Qy 1 MetAlaAlaAlaMetAspValAspThr ProSerGly 12 

ill =-111 III 

Db 88 TTAATGGCGACTCTAGACTCCGACGTTACCATGATTCCTGCCGGAGAAGCCTCCAGCAGC 14 7 



Qy 



Db 



13 



148 



ThrAsnSerGlyAlaGlyLysLysArgPheGluValLysLysTrpAsnAla 29 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GTAGCCGCGTCGTCTTCCAACAAGAAAGCTAAGCGATTCGAAATTAAGAAGTGGAGCGCC 2 07 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 



208 



Va 1 Al aLeuTrpAl aTrpAsp 1 1 e Va 1 Val AspAsnCy s Al a I le Cy s ArgAsnHi slle 4 9 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIII 

GTTG C TC T CTGGGCTTGGGAT ATCGTTGTTG AC AACTGTGCGATC TGC AG AAACC AC AT C 267 



5 0 MetAspLeuCysIleGluCysGlnAlaAsnGlnAlaSerAlaThrSerGluGluCysThr 69 

IIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIII 

268 ATGGATCTTTGTATCGAGTGTCAGGCTAATCAGGCCAGTGCCACAAGTGAAGAGTGCACT 32 7 



70 



89 



ValAlaTrpGlyValCysAsnHisAlaPheHisPheHisCysIleSerArgTrpLeuLys 

! II 1 1 M I II 1 1 1 1 1 IN ! !| 1 1 Mill 1 1 IM! 1 1 1 i 'III I M I II II I M 1 1 

3 2 8 GTAGCTTGGGGGGTTTGCAATCACGCCTTCCACTTTCACTGCATCAGCAGATGGCTAAAG 3 87 



90 



108 



ThrArgGlnValCysProLeuAspAsnArgGluTrpGluPheGlnLysTyrGlyHis 

lllllllllllllllllllllllllll lllllllllllllllllllllllllll 
3 8 8 ACTCGTCAAGTTTGTCCATTGGATAACAGTGAGTGGGAGTTTCAGAAATATGGTCAC 444 



RESULT 14 

AY072430 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS . 



AY072430 666 bp mRNA linear PLN 21-JAN-2002 

Arabidopsis thaliana ring-box protein - like (At5g20570> mRNA, 
complete cds . 
AY072430 

AY072430.1 GI:18252990 
FLI CDNA. 



SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 



Arabidopsis thaliana. 
Arabidopsis thaliana 

Eukaryota; Viridiplantae ; Streptophyta; Embryophyta; Tracheophyta; 
Spermatophyta; Magnoliophyta; eudicotyledons ; core eudicots; 
Rosidae; eurosids II; Brassicales ; Brassicaceae ; Arabidopsis. 
1 (bases 1 to 666) 

Nguyen, M . , Karlin-Neumann, G. , Southwick, A. , Lam,B., Miranda, M. , 
Palm, C. J., Bowser, L., Jones, T. , Banh,J., Carninci,P., Chen,H., 
Cheuk,R., Chung, M.K., Hayashizaki , Y . , Ishida,J., Kamiya,A. , 
Kawai,J., Kim,C, Lin, J. , Liu,S.X., Narusaka,M., Pham,P.K., 
Sakano,H., Sakurai,T., Satou,M., Seki,M., Shinn,P., YaTnada,K., 
Shinozaki , K. , Ecker,J., Theologis,A. and Davis, R.W. 
Direct Submission 

Submitted (02- JAN- 2 002 ) DNA Sequencing and Technology Center, 
Stanford University, 855 California Avenue, Palo Alto, CA 94304, 
USA 

e-mail for correspondence: arab@sequence.stanford.edu 



RIKEN Genomic Sciences Center (GSC) members carried out the 
collection and clustering of RAFL cDNAs (RAFL cDNA : 1 RIKEN 
Arabidopsis Full-Length cDNA' ) : Seki,M., Narusaka,M., Ishida,J, 
Satou,M., Kamiya,A., Sakurai,T., Carninci,P., Kawai,J., 
Hayashizaki , Y. and Shinozaki ,K. 



The Salk, Stanford, PGEC (SSP) Consortium members carried out the 
sequencing and annotation of the RAFL cDNAs : Nguyen, M., 
Southwick, A. , Karlin-Neumann, G. , Lam,B. , Miranda, M. , Palm, C. J., 
Bowser, L . , Jones, T., Banh,J., Chen,H., Cheuk,R., Chung, M.K., 
Kim,C, Lin, J., Liu,S.X., Pham,P.K., Sakano,H., Shinn,P., 
Yamada,K., Ecker,J., Theologis,A. and Davis, R.W. 



Nguyen, M, (SSP/Stanf ord) and Seki,M. (RIKEN GSC) contributed 
equally to this work. Shinozaki, K. (RIKEN GSC) and Davis, R.W. 
(SSP/Stanford) contributed equally to this work as Pis. 
FEATURES Location/Qualifiers 
source 1. .666 

/organism= "Arabidopsis thaliana" 
/db_xref ="taxon: 3702" 
/chromosome= 11 5 " 
/clone="RAFL0 9-7 8-L12" 
/note="This clone is in pBluescript 
ecotype: Columbia" 
gene 1. .666 

/gene="At5g20570" 
CDS 88. .444 

/gene="At5g20570" 
/codon_start=l 

/product =" ring -box protein - like" 
/protein_id= "AAL62422 .1" 
/db_xref="GI : 18252991" 

/ 1 r ans 1 a t i on= " MATLDSD VTMI PAGEAS S SVAASS SNKKAKRFE I KKW S AVAL W A 
WD I WDNC AI CRNH IMDLC I ECQANQASATSEECTVAWGVCNHAFHFHC I SRWLKTRQ 
VCPLDNSEWEFQKYGH 1 ' 

BASE COUNT 181 a 137 c 149 g 199 t 

ORIGIN 



Alignment Scores: 



Pred. No. : 1. 3e-47 

Score: 515.50 

Percent Similarity: 83.19% 

Best Local Similarity: 77.31% 

Query Match: 83 .69% 

DB: 8 



Length: 
Matches : 
Conservative : 
Mismatches : 
Indels : 
Gaps : 



666 
92 
7 
9 

11 
2 



US-09-541-462B-2 (1-108) X AY072430 (1-666) 



Qy 1 MetAlaAlaAlaMetAspValAspThr ProSerGly 12 

lit -MM III llh-lll 
Db 85 TTAATGGCGACTCTAGACTCCGACGTTACCATGATTCCTGCCGGAGAAGCCTCCAGCAGC 144 

Qy 13 ThrAsnSerGlyAlaGlyLysLysArgPheGluValLysLysTrpAsnAla 2 9 

::::::||| I I I I I I I I I I I I = I I I I I I I I h - I I I 

Db 145 GTAGCCGCGTCGTCTTCCAACAAGAAAGCTAAGCGATTCGAAATTAAGAAGTGGAGCGCC 2 04 



Qy 

Db 
QY 
Db 

Qy 

Db 

Qy 

Db 



205 



ValAlaLeuTrpAlaTrpAspIleValValAspAsnCysAlalleCysArgAsnHisIle 4 9 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

GTTGCTCTCTGGGCTTGGGATATCGTTGTTGACAACTGTGCGATCTGCAGAAACCACATC 2 64 



50 MetAspLeuCysIleGluCysGlnAlaAsnGlnAlaSerAlaThrSerGluGluCysThr 69 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIII 

265 ATGGATCTTTGTATCGAGTGTCAGGCTAATCAGGCCAGTGCC ACAAGTGAAGAGTGCACT 324 
70 ValAlaTrpGlyValCysAsnHisAlaPheHisPheHisCysIleSerArgTrpLeuLys 89 

iiiiiiiiliiiiiiiiiiiiiiiiiiiiiiillllliiiiiiiiiiiiiiiiiiinii 

3 25 GTAGCTTGGGGGGTTTGC AATCACGCCTTCC ACTTTC ACTGC ATCAGCAGATGGCTAAAG 384 



90 



108 



ThrArgGlnValCysProLeuAspAsnArgGluTrpGluPheGlnLysTyrGlyHis 

iMIMIMM ill MM MM IMMIMiMMM MMM III 

3 85 ACTCGTCAAGTTTGTCCATTGGATAACAGTGAGTGGGAGTTTCAGAAATATGGTCAC 441 



RESULT 15 

AY052401 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



AY052401 357 bp mRNA linear PLN 16-OCT-2 

Arabidopsis thaliana ring box-l-like protein mRNA, complete cds . 
AY052401 

AY052401.1 GI:16186264 

Arabidopsis thaliana. 
Arabidopsis thaliana 

Eukaryota,- Viridiplantae ; Streptophyta ; Embryophyta; Tracheophyt 

Spermatophyta; Magnoliophyta; eudicotyledons ; core eudicots; 

Rosidae; eurosids II; Brassicales; Brassicaceae ; Arabidopsis. 

1 (bases 1 to 357) 

Okresz, L. 

Direct Submission 

Submitted (25-AUG-2001) Institute of Plant Biology, Biological 
Research Center, Temesvari krt . 62, Szeged H-6701, Hungary 

Location/Qualifiers 

1. .357 

/organism= "Arabidopsis thaliana" 
/ db_xre f = " t axon : 3 7 0 2 " 
1. .357 

/note= M RBXl-2" 
/codon start=l 



BASE COUNT 
ORIGIN 



/product="ring box-l-like protein" 
/protein_id="AAL13435 .1" 
/db_xref="GI: 16186265" 

/ t r ans lat ion= " MATLDSDVTMI PAGEAS S S VAASSSNKKAKRFE I KKWSAVALWA 
WDIWDNCAICRNHIMDLCIECQANQASATSEECTVAWGVCNHAFHFHCISRWLKTRQ 
VCPLDNSEWEFQKYGH " 



83 c 



94 g 



89 t 



Alignment Scores: 

Pred. No.: 8.67e-48 Length: 357 

Score: 514.50 Matches: 92 

Percent Similarity: 83.76% Conservative: 6 

Best Local Similarity: 78.63% Mismatches: 8 

Query Match: 83.52% Indels : 11 

DB : 8 Gaps : 2 

US-09-541-462B-2 (1-108) x AY052401 (1-357) 



Qy 



Db 



3 AlaAlaMetAspValAspThr ProSerGly 12 

III :::|ll III llh-ill 

4 GCGACTCTAGACTCCGACGTTACCATGATTCCTGCCGGAGAAGCCTCCAGCAGCGTAGCC 6 3 



Qy 13 ThrAsnSerGlyAlaGlyLysLysArgPheGluValLysLysTrpAsnAlaValAla 31 

---III I I I I I I I I I I I I = * = I I I I I I I I h = = I I I I I I I I I 

Db 64 GCGTCGTCTTCCAACAAGAAAGCTAAGCGATTCGAAATTAAGAAGTGGAGCGCCGTTGCT 12 3 

Qy 32 LeuTrpAlaTrpAspIleValValAspAsnCysAlalleCysArgAsnHisIleMetAsp 51 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIII 

Db 124 CTCTGGGCTTGGGATATCGTTGTTGACAACTGTGCGATCTGCAGAAACCACATCATGGAT 183 
Qy 52 LeuCysIleGluCysGlnAlaAsnGlnAlaSerAlaThrSerGluGluCysThrValAla 71 

MIIIIIIMIIIIIMIIIMMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMM 

Db 184 CTTTGTATCGAGTGTCAGGCTAATCAGGCCAGTGCCACAAGTGAAGAGTGCACTGTAGCT 24 3 
Qy 72 TrpGlyValCysAsnHisAlaPheHisPheHisCysIleSerArgTrpLeuLysThrArg 91 

IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIII 

Db 244 TGGGGGGTTTGCAATCACGCCTTCCACTTTCACTGCATCAGCAGATGGCTAAAGACTCGT 303 



Qy 92 GlnValCysProLeuAspAsnArgGluTrpGluPheGlnLysTyrGlyHis 10 8 

IIIIIMIMIIIIIIIIMI IMIIIMIIIIIIIIIIIIIIIIMI 

Db 304 CAAGTTTGTCCATTGGATAACAGTGAGTGGGAGTTTCAGAAATATGGTCAC 354 
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